The Stackelberg equilibrium for one-sided zero-sum partially observable stochastic games

نویسندگان

چکیده

Formulating cyber-security problems with attackers and defenders as a partially observable stochastic game has become trend recently. Among them, the one-sided two-player zero-sum (OTZ-POSG) emerged popular model because it allows players to compete for multiple stages based on partial knowledge of system. All existing work OTZ-POSG focused simultaneous move scenario assumed that one player’s actions are private in execution process. However, this assumption may questionable since action be detected by opponent through deploying detection strategies. Hence, paper, we propose turn-based public investigate existence properties Stackelberg equilibrium game. We first prove one-stage case show can converted into linear-fractional programming problem therefore solved linear programming. For stages, main challenge is information leakage issue run-time reveals certain achieve more rewards future. To deal issue, adopt concept ε-Stackelberg achieved finite-horizon OTZ-POSGs. space partition approach solve iteratively value function leader piece-wise follower constant stages. Finally, examples given illustrate functions constant.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Programming for One-sided Partially Observable Pursuit-evasion Games

We study two player pursuit-evasion games with concurrent moves, infinite horizon, and discounted rewards. The players have partial observability, however, the evader is given an advantage of knowing the current position of the units of the pursuer. We show that (1) value functions of this game depend only on the position of the pursuing units and the belief the pursuer has about the position o...

متن کامل

Dynamic Programming for Partially Observable Stochastic Games

We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable Markov decision processes (POMDPs) and iterative elimination of dominated strategies in normal form games. We prove that it iteratively eliminates very weakly dominated strategies without first forming the normal form r...

متن کامل

Definable Zero-Sum Stochastic Games

Definable zero-sum stochastic games involve a finite number of states and action sets, reward and transition functions that are definable in an o-minimal structure. Prominent examples of such games are finite, semi-algebraic or globally subanalytic stochastic games. We prove that the Shapley operator of any definable stochastic game with separable transition and reward functions is definable in...

متن کامل

Planning for Weakly-Coupled Partially Observable Stochastic Games

Partially observable stochastic games (POSGs) provide a powerful framework for modeling multi-agent interactions. While elegant and expressive, the framework has been shown to be computationally intractable [Bernstein et al., 2002]. An exact dynamic programming algorithm for POSGs has been developed recently, but due to high computational demands, it has only been demonstrated to work on extrem...

متن کامل

Dynamic Programming Approximations for Partially Observable Stochastic Games

Partially observable stochastic games (POSGs) provide a rich mathematical framework for planning under uncertainty by a group of agents. However, this modeling advantage comes with a price, namely a high computational cost. Solving POSGs optimally quickly becomes intractable after a few decision cycles. Our main contribution is to provide bounded approximation techniques, which enable us to sca...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Automatica

سال: 2022

ISSN: ['1873-2836', '0005-1098']

DOI: https://doi.org/10.1016/j.automatica.2022.110231